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Structural insights into SARS coronavirus proteins 
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The SARS coronavirus was identified as the pathogen of a 
global outbreak of SARS (severe acute respiratory syndrome) in 
2003. Its large RNA genome encodes four structural proteins, 
sixteen non-structural proteins and eight accessory proteins. 
The availability of structures of SARS coronavirus 
macromolecules has enabled the elucidation of their important 
functions, such as mediating the fusion of viral and host cellular 
membranes, and in replication and transcription. In particular, 
the spike protein fusion core and the main protease have been 
the most extensively studied, with the aim of designing anti- 
SARS therapeutics. Attention is now being focused on 
replicase proteins, which should enhance our understanding of 
the replication and transcription machinery. The structures and 
functions of most SARS proteins remain unknown, and further 
structural studies will be important for revealing their functions 
and for designing potential anti-SARS therapeutics. 
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Introduction 

In 2003, a previously unknown coronavirus termed SARS- 
CoV was identified as the causative agent of severe acute 
respiratory syndrome (SARS), responsible for a worldwide 
epidemic with approximately 800 deaths [1-4]. The most 
likely explanation for the emergence of SARS-CoV is 
animal-to-human interspecies transmission [5]. However, 
the animal reservoir for SARS-CoV in nature remains to 
be identified and the mechanism of viral adaptation to 
human hosts requires further investigation. 


SARS-CoV is a plus-strand RNA virus featuring a large 
single-stranded RNA genome of approximately 29 700 
nucleotides [6,7]. The genome 1s predicted to consist of at 
least fourteen functional ORFs that encode three classes 
of proteins: two large polyproteins, ppla and pplab, 


which are cleaved into sixteen non-structural proteins 
(nsps) required for viral RNA synthesis (and probably 
other functions); four structural proteins (the S, E, M and 
N proteins), essential for viral assembly; and eight acces- 
sory proteins, which are thought unimportant in tissue 
culture but may provide a selective advantage in the 


infected host (Table 1) [8]. 


In this review, recent structural studies of SARS-CoV 
macromolecules (including a conserved RNA motif) are 
summarised, focusing on those proteins that mediate the 
fusion of the viral membrane with the host cell membrane, 
or that are involved in coronavirus genome replication and 
transcription. The latter have been extensively studied, 
with the aim of designing anti-SARS therapeutics. 


Structural proteins 

The SARS coronavirus includes four structural proteins 
that are required to drive cytoplasmic viral assembly: the 
spike (S) protein, the membrane (M) protein, the nucleo- 
capsid (N) protein and the envelope (E) protein (‘Table 1) 
[6,7]. Here, we will focus on the S protein and N protein, 
whose partial structures have been solved. 


SARS spike protein fusion core 

Similar to other class I virus fusion proteins, the SARS- 
CoV S protein can be subdivided into an N-terminal half 
(S1) and C-terminal half (S2), but without proteolytic 
cleavage [9°]. S1 is responsible for variations in host 
range and tissue tropism according to its receptor speci- 
ficity, whereas S2 is responsible for cell entry following 
virus and host cell membrane fusion [10]. S1 is respon- 
sible for binding to cellular receptors; one potential 
SARS-CoV receptor has been identified as angiotensin- 
converting enzyme 2 (ACE2) [11]. S2 contains an internal 
fusion peptide and has two hydrophobic (heptad) repeat 
regions, designated HR1 and HR2 [12]. The putative 
fusion peptide has recently been identified upstream of 
and near to HR1 [13]. HR2 is located close to the 
transmembrane region, some 170 amino acids down- 
stream of HR1 [12]. The classical mechanism of envel- 
oped virus and host cell membrane fusion mediated by 
class I fusion proteins was established by Wiley and 
colleagues in their comprehensive study of influenza 
hemagglutinin (HA), with structures of the unprocessed 
precursor, the cleaved metastable HA1-HA2 heterodimer 
and post-fusion conformations available [14,15]. In the 
following years, extensive structural studies of the ortho- 
myxovirus, retrovirus, paramyxovirus and __filovirus 
families have led to a common fusion mechanism [15]. 
To confirm the value of fusion proteins as anti-viral 
targets, an HIV-1 membrane fusion inhibitory peptide, 


Current Opinion in Structural Biology 2005, 15:664-672 


www.sciencedirect.com 


Table 1 
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Summary of SARS proteins. 


Protein size 
(amino acids) 


Protein 
sequence) 


Structural proteins 


Spike (S) protein 1255 ORF2 (21492-25259) 
Envelope (E) protein 76 ORF4 (26117-26347) 
Membrane (M) 221 ORF5 (26398-27063) 
protein 
Nucleocapsid (N) 422 ORF9a (28120-29388) 
protein 
Non-structural proteins (Nsps) 
Nsp1 180 ORF 1a (265-804) 
Nsp2 638 ORF 1a (805-2718) 
Nsp3 1922 ORF 1a (2719-8484) 
Nsp4 500 ORF 1a (8485-9984) 
Nsp5 306 ORF 1a (9985-10902) 
Nsp6 290 ORF 1a (10903-11772) 
Nsp7 83 ORF1a (11773-12021) 
Nsp8 198 ORF1a (12022-12615) 
Nsp9 113 ORF 1a (12616-12954) 
Nsp10 139 ORF 1a (12955-13371) 
Nsp11 13 ORF 1a (13372-13410) 
Nsp12 932 ORF1b (13398-16166) 
Nsp13 601 ORF 1b (16167-17969) 
Nsp14 527 ORF1b (17970-19550) 
Nsp15 346 ORF1b (19551-20588) 
Nsp16 298 ORF1b (20589-21482) 
Accessory proteins 
Orf3a 274 ORF8a (25268-26092) 
Orf3b 154 ORF3b (25689-26153) 
Orfé 63 ORF6 (26913-27265) 
Orf7a 122 ORF7a (27273-27641) 
Orf7b 44 ORF7b (27638-27772) 
Orf8a 39 ORF8a (27779-27898) 
Orf8b 84 ORF8b (27864-28118) 
Orf9b 98 ORF9b (28130-28426) 


ORF (location in genome 


Putative functional Structure available? 


domain(s) 
Yes (S protein fusion core) 
No 
No 
Yes (N-terminal RNA-binding 
domain) 
No 
No 
Ac, X, PL2°"°, Y (TM1), ADRP No 
TM2 No 
mpre Yes 
TM3 No 
Yes? 
Yes* 
ssRNA binding Yes 
GFL No 
No 
RdRp No 
ZD, NTPase, HEL1 No 
Exonuclease (ExoN homologue) No 
NTD, endoribonuclease (XendoU No 
homologue) 
2'-O-MT No 
No 
No 
No 
Ig like Yes (luminal domain) 
No 
No 
No 
No 


? Structure has been deposited in the PDB, but has not been published. Ac, acidic domain; ADRP, adenosine diphosphate-ribose 
1’-phosphatase; ExoN, 3/-5’ exonuclease; GFL, growth-factor-like domain; HEL1, superfamily 1 helicase; MP'°, main (or 3C-like cysteine) 
protease; NTD, nidovirus conserved domain; NTPase, nucleoside triphosphatase; 2’-O-MT, S-adenosylmethionine-dependent ribose 


pro 


2’-O-methyltransferase; PL2 


‘T-20 (developed by Trimeris, Research Triangle Park, 
North Carolina, USA), which targets the prehairpin inter- 
mediate, was recently approved by the US Food and Drug 
Administration as a new anti-HIV drug [16]. 


In 2004, the structure of the S protein fusion core, con- 
sisting of the HR1 and HR2 regions, was determined by 
two groups in the post-fusion (or fusion-active) state 
[9°°,17°] (Figure 1). Xu and co-workers [17°] constructed 
a single chain by engineering a linker between the HR1 
and HR2 regions to prepare the fusion core (HR1: amino 
acids 900-948; HR2: amino acids 1145-1184), whereas 
Supekar and colleagues [9°*] individually synthesized 
longer HR1 and HR2 peptides to prepare the complex 
(HR1: amino acids 889-972; HR2: amino acids 1142- 
1185). Both structures exhibit a six-helix bundle in which 
three HR1 helices form a central coiled coil surrounded 


, papain-like protease 2; RdRp, RNA-dependent RNA polymerase; TM, transmembrane domain; X, Y, domains 
with unknown or hypothetical function; ZD, putative zinc-binding domain. 


by three HR2 helices in an oblique antiparallel manner. 
HR2 peptides pack into the hydrophobic grooves of the 
HR1 trimer in a mixed extended and helical conforma- 
tion; this represents a stable post-fusion structure, similar 
to that observed for HIV-1 gp41 [15]. The N terminus of 
HR1 and the C terminus of HR2 are located at the same 
end of the six-helix bundle, which would place the fusion 
peptide and transmembrane region close together. Supe- 
kar et a/. [9°°] also provided the structure of an S2 
fragment consisting of a smaller HR1 peptide (amino 
acids 919-949) and an HR2 peptide with extra C-terminal 
residues in proximity to the transmembrane region 
(amino acids 1149-1193) (Figure 1). The C-terminal part 
is a helical and points away from the HRI trimer axis, 
probably due to the lack of stabilisation by the corre- 
sponding HR1 region. The authors consider that this 
could mimic the conformation of this region before the 
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Figure 1 
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The SARS-CoV S protein fusion core. (a) Comparison of four ‘six-helix bundle’ structures. Shown from left to right are S protein fusion cores 
1IWYY [18], 2BEZ [17°], 1WNC [9°*] and 2BEQ [9°]. The central HR1 peptides are shown in ribbon representation, and are coloured red, blue 
and green. The HR2 peptides are shown in black. The N and C termini are labelled. (b) Comparison of four ‘HR1+HR2’ constructs, 
corresponding to the structures in (a). The labelled residues correspond to the start and end residues of the HR1 (red) and HR2 (black) peptides. 


(Figure adapted from [18].) 


formation of the final post-fusion hairpins. A later struc- 
ture was reported by Duquerroy and colleagues [18] 
(HR1: amino acids 890-973; HR2: amino acids 1145- 
1190) (Figure 1), in which they emphasized the hydro- 
gen-bonding network formed by conserved asparagine 
and glutamine residues, together with two possible chlor- 
ides, which could stabilise the conformation of post- 
fusion hairpins. 


Fusogenic mechanisms similar to those of other class I 
fusion proteins have been proposed for SARS-CoV 
[16°,18,19°]. However, understanding the possible con- 
formational changes of the fusion peptide, HR1 and HR2 
during the membrane fusion process needs further struc- 


tural studies of the native state of the S protein and the 
prehairpin intermediate that probably results from S1 
binding to a receptor (e.g. ACE2). 


Several peptides derived from HR1 and HR2 regions of 
SARS-CoV S proteins have been synthesized to block 
viral entry, targeting the putative prehairpin intermediate 
[17°,20,21]. Two groups discovered that only peptides 
derived from HR2, and not from HR1, inhibited SARS- 
CoV infection [17°,20]. Moreover, the efficacy of HR2 
peptides derived from SARS-CoV S protein is lower than 
that of corresponding HR2 peptides derived from murine 
coronavirus mouse hepatitis virus (MHV) in inhibiting 
MHYV infection [20]. Supekar and colleagues considered 
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that this might be due to the lower affinity of these 
peptides for the corresponding HR1 trimer [20], as a larger 
surface area is buried in the HR1-HR2 interface of MHV 
S2 than in SARS-CoV S2 [9°]. In any case, determination 
of the HR1-HR2 fusion core structure will help in the 
discovery of viral entry inhibitors against SARS. 


SARS nucleocapsid protein RNA-binding domain 

The N protein, which binds to the genomic RNA via a 
leader sequence, recognises a stretch of RNA that serves 
as a packaging signal and leads to the formation of the 
helical ribonucleoprotein (RNP) complex during assem- 
bly [22]. The structure of the RNA-binding domain of the 
SARS-CoV N protein was determined by NMR spectro- 
scopy in 2004 [23]. It consists of a five-stranded B sheet 
whose fold is unrelated to that of other RNA-binding 
proteins (Figure 2a). The authors identified a binding site 
for single-stranded RNA (ssRNA), using NMR to deter- 


Figure 2 


SARS coronavirus proteins Bartlam, Yang and Rao 667 


mine the resonance of residues perturbed by the addition 
of RNA, and revealed a similar mode of interaction to 
RNA-binding proteins such as U1A RNP. They also 
identified small molecules from an NMR-based screen 
that bind to the RNA-binding domain and might impair 
its function. 


Antigenic peptides of the coronavirus N protein can be 
recognised by T cells on the surface of infected cells 
[24,25]. ‘The structure of the MHC-I molecule HLA- 
A*1101 in complex with such a peptide derived from 
the SARS-CoV N protein, a nonamer with a SARS-spe- 
cific sequence, has recently been determined to 1.45 A 
resolution [26]. Although it is similar to other MHC-I 
molecules and shows a similar peptide-binding mode, the 
structure adds to the growing library of MHC-I structures 
and could be used as a template for peptide-based vaccine 
design. 


(a) 


“ 


(c) (d) 


Cc 


Current Opinion in Structural Biology 


(b) 


Other structures of SARS-CoV proteins. (a) Solution structure of the N-terminal RNA-binding domain of the SARS-CoV N protein (PDB code 
1SSk). (b) X-ray crystal structure of nsp9, an ssRNA-binding protein (PDB code 1UW7). (c) X-ray crystal structure of the accessory protein 
Orf7a (PDB code 1XAk). (d) X-ray crystal structure of the s2m, a rigorously conserved RNA element of the SARS-CoV genome. 
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Non-structural proteins 

The SARS-CoV replicase gene encodes 16 nsps with 
multiple enzymatic functions (Table 1) [27]. These are 
known or are predicted to include types of enzymes that 
are common components of the replication machinery of 
plus-strand RNA viruses: an RNA-dependent RNA poly- 
merase activity (RdRp, nsp12); a 3C-like serine protease 
activity (MP? or 3CL?, nsp5); a papain-like protease 2 
activity (PL2°, nsp3); and a superfamily-1 helicase 
activity (HEL 1, nsp13) [6,28,29°°]. In addition, the repli- 
case gene encodes proteins that are indicative of 3’-5’ 
exoribonuclease activity (ExoN homologue, nsp14), 
endoribonuclease activity (XendoU homologue, nsp15), 
adenosine diphosphate-ribose 1’-phosphatase activity 
(ADRP, nsp3) and ribose 2’-O-methyltransferase activity 
(2'-O-MT, nsp16) [27]. These enzymes are less common 
in plus-strand RNA viruses, and may therefore be related 
to the unique properties of coronavirus replication and 
transcription. Finally, the replicase gene encodes another 
nine proteins, of which little is known about their struc- 
ture or function. The nsps 4, 10 and 16 have been 
implicated by genetic analysis in the assembly of a func- 
tional replicase-transcriptase complex. Here, we detail 
two nsps with available structures, of which nsp5 is the 
most extensively characterised. 


Nsp5: the main protease (MP’° or 3CL""°) is a target 

for anti-viral drug design 

The replicase polyproteins, ppla and pplab, undergo 
extensive proteolytic processing by viral proteases to 
produce multiple functional subunits, which are involved 
in the formation of the replicase complex that mediates 
viral replication and transcription. The coronavirus main 
protease (MP"°), also known as the 3C-like protease 


Figure 3 


(3CL?"°) after the 3C proteases of the Picornaviridae, is 
an +33 kDa cysteine protease that cleaves the replicase 
polyprotein at 11 conserved sites with canonical Leu- 
Gln|(Ser, Ala, Gly) sequences. The cleavage process 1s 
initiated by the enzyme’s own autolytic cleavage from 
ppla and pplab [30,31]. Its functional importance in the 
viral life cycle and the lack of closely related cellular 
homologues make MP" an attractive target for the devel- 
opment of drugs directed not only against SARS but also 
against other coronavirus infections [29°°,30-32]. 


The first structural models of SARS MP"? were homology 
models built from the crystal structures of MP from 
human coronavirus strain 229E (HCoV-229E) and por- 
cine transmissible gastroenteritis virus (TGEV), both 
group I coronaviruses. These homology models were 
widely used in the design of anti-SARS inhibitors 
[32,33]. In 2003, shortly after the peak of the SARS 
outbreak, Yang ef a/. reported the first crystal structure 
of SARS-CoV MP" to 1.9A resolution, which provided 
important structural information for rational drug design 
(Figure 3) [29°]. CoV MP?"° forms a dimer and each 
protomer consists of three domains: domains I and II 
resemble chymotrypsin, whereas domain III has a glob- 
ular cluster of five mostly antiparallel a helices. The cleft 
between domains I and II is the location of substrate 
recognition and catalysis. Domain III and the additional 
N-terminal finger of domain I (or ‘N-finger’) are consid- 
ered to influence the dimerisation and activity of MP 
[29°°, 34-36] (Figure 3). However, one group has reported 
incongruent results concerning the role of the N-finger in 
dimerisation [37]. In contrast to common serine pro- 
teases, which have a catalytic triad, CoV MP?" has a 
Cys-His catalytic dyad. An exceptionally stable water 


(a) 


N-finger 


Inhibitor 
es = Inhibitor 


(b) 


N-finger 
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Nsp5, the SARS-CoV MP'°. (a) The crystal structure of SARS-CoV MP" in complex with a CMK inhibitor (PDB code 1UKW). Protomers A and 

B are shown in ribbon representation, and are coloured red and blue, respectively. The CMK inhibitors are shown in yellow stick representation. 
The N-finger, residues 1-7 of protomer B, is shown in green. A transparent molecular surface is shown covering the structure. (b) Schematic 

of the SARS-CoV MP dimer, corresponding to the view in (a). Residue S1 on the N-finger of protomer B forms hydrogen bonds with two 


residues in protomer A, F140 and E166. 
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molecule occupies the position of the usual third member 
of the triad, which might stabilise the protonated histi- 
dine in the intermediate state during proteolytic clea- 
vage. 


As a prelude to inhibitor design, the structure of SARS- 
CoV MP in complex with a substrate analogue (a chlor- 
omethyl ketone [CMK] inhibitor, Cbz-VNSTLQ-CMK) 
was determined in 2003 (Figure 3). The sequence of this 
substrate analogue was derived from residues P6—-P1 of 
the N-terminal autoprocessing site of TGEV MP?° [32]. 
However, the two protomers of SARS-CoV MP" each 
exhibited an unexpected binding mode. This most prob- 
ably resulted from the comparatively weak binding of 
peptidyl elements derived from the substrate of TGEV 
MP and from the highly reactive electrophile CMK, 
suggesting that nucleophilic attack might have occurred 
before a stable non-covalently bound enzyme-inhibitor 
complex was formed [38°°]. 


Following the SARS outbreak, a series of potential inhi- 
bitors against SARS-CoV MP" was reported, some of 
which could prevent viral replication 7” vitro [39-43]. 
However, complex structures are rarely available to guide 
further modification of these compounds. A recent study 
of representative structures from all three groups of the 
genus Coronavirus has indicated that all CoV MP"? share a 
highly conservative substrate-recognition pocket [38°*]. 
Mechanism-based irreversible inhibitors were designed 
based on this conserved structural region, and further 
modification of these compounds could possibly lead to 
the discovery of a single agent with clinical potential 
against existing and possible future emerging CoV- 
related diseases [38°°]. 


Nsp9: an ssRNA-binding protein 

Crystal structures of nsp9, determined simultaneously in 
2004 by Egloffera/. (to 2.7 A resolution) [44°*] and Sutton 
et al. (to 2.8A resolution) [45°°], have established its 
previously unknown function as an ssRNA-binding pro- 
tein. Both groups report that the biological unit is a dimer. 
The core structure of the protein is an open six-stranded B 
barrel reminiscent of, although unrelated to, the nucleic 
acid binding OB (oligosaccharide/oligonucleotide-bind- 
ing) fold (Figure 2b). Instead, nsp9 is structurally homo- 
logous to certain subdomains of serine proteases, most 
notably domain II of SARS-CoV MP"°. Based on this 
similarity to the picornavirus 3C proteases, which feature 
a conserved RNA-binding motif, it was inferred that nsp9 
should bind ssRNA; this was subsequently confirmed by 
electrophoretic mobility shift assays (EMSAs) [45°*] and 
surface plasmon resonance [44°*]. One role of nsp9 may 
be to stabilise nascent and template RNA strands during 
replication and transcription, and to protect them against 
nuclease processing. Besides replication, nsp9 may also 
be involved in base-pairing-driven processes, such as 
RNA processing. 
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In addition to their nsp9 structure, Sutton and colleagues 
showed evidence of its interaction with nsp8 [45°°]. 
Furthermore, dual-labelling studies of SARS-CoV repli- 
case proteins have demonstrated co-localisation of nsp8 
with nsp2 and nsp3 [46], and an interaction between nsp7 
and nsp8 has also been found (Z Rao, unpublished; see 
Update), suggesting that the nsps assemble to form a 
sophisticated viral replication/transcription machinery. 
Nsp9 is the first component of the complex with an 
available three-dimensional structure, providing a starting 
point to reveal the architecture and underlying functions 
of the replication/transcription complex. 


Accessory proteins 

The genomic sequences of numerous SARS-CoV isolates 
have been determined. The ‘conserved’ open reading 
frames (ORFs) of the SARS-CoV genome occur in the 
same order as and are of similar size to those found in 
other coronaviruses. However, in addition to the con- 
served genes, the SARS-CoV genome contains eight 
novel ORFs at the 3’ end (ORFs 3a, 3b, 6, 7a, 7b, 8a, 
8b and 9b) (‘Table 1) [27]. To date, the functions of these 
genes remain largely unknown, although their absence 
from other genomes suggests unique functions that might 
be advantageous to SARS-CoV replication, assembly or 
virulence [8]. Only one of these so-called accessory pro- 
teins has a known structure and further studies are 
required to elucidate their precise functions. 


The Orf7a accessory protein 

Sequence analysis predicted that ORF 7a encodes a type 
I transmembrane protein of 122 amino acids, consisting of 
a 15-residue N-terminal signal peptide, an 81-residue 
luminal domain, a 21-residue transmembrane segment 
and a 5-residue cytoplasmic tail [27]. The Orf7a sequence 
has been identified in all isolates of SARS-CoV collected 
from both human and animal sources, but it appears to be 
unique to SARS, with no significant similarity to any other 
viral or non-viral protein. The structure of the luminal 
domain of the Orf7a accessory protein was determined 
earlier this year to 1.8 A resolution. It reveals a compact 
Ig-like domain with a fB-sandwich fold topology 
(Figure 2c), despite its unusually small size and lack of 
significant sequence similarity to other members of the Ig 
superfamily [47°]. This common structural fold occurs ina 
wide variety of proteins, where it performs a diverse set of 
functions, making it difficult to predict the functional role 
of Orf7a from the structure alone. For example, the fold is 
found in proteins of the extracellular matrix, muscle 
proteins, proteins of the immune system, cell surface 
receptors, enzymes, transcription factors and a wide vari- 
ety of viral proteins [48]. 


Other structures 
The crystal structure of the stem-loop II motif (s2m) 
RNA element of SARS-CoV was determined in 2005 to 


2.7 Aresolution [49°*]. s2m is a rigorously conserved motif 
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located at the 3’ end of SARS and other coronaviruses, as 
well as astroviruses [50]. The highly structured s2m RNA 
element includes a remarkable 90° bend of the helix axis 
(Figure 2d). Several novel longer-range tertiary interac- 
tions create a tunnel perpendicular to the main helical 
axis, whose interior is negatively charged and binds two 
magnesium ions. These unusual features form probable 
surfaces for interaction with conserved host cell compo- 
nents or other reactive sites required for virus function. 
An interesting observation is the possible mimicry by s2m 
RNA of an rRNA fold, the 530 loop of 16S rRNA [51]. 
This implies a mechanism for RNA hijacking of host 
protein synthesis in SARS, similar to that observed in 
other RNA viruses [52]. The 530 loop of the 30S ribosome 
binds to prokaryotic proteins $12 and IF-1, further sug- 
gesting that s2m may interact with their eukaryotic 
homologues [49°°]. Nevertheless, the high sequence con- 
servation of s2m in an otherwise rapidly mutable RNA 
genome implies its pathogenic importance and signals 
that it could be another attractive target for the design of 
anti-viral therapeutics. 


Conclusions 

The rapid growth in the availability of SARS-CoV protein 
structures since the 2003 outbreak has emphasized the 
importance and strength of structural biology as a tool 
to address significant health-related issues, including 
functional annotation of proteins and identification of 
important drug targets. An important wealth of informa- 
tion and clues for further study have been accumulated 
from the SARS-CoV macromolecular structures deter- 
mined so far. The first SARS-CoV structure to be deter- 
mined, M?", is an important target for drug design 
and has been widely used since 2003 as a basis for 
inhibitor design, with promising results. Similarly, the 
S protein fusion core has also been confirmed as an 
important drug target for the design of fusion inhibitor 
peptides. Future prospects for SARS structural biology 
include the structures of replicase proteins alone and in 
protein-protein complexes, with the aim of understand- 
ing the sophisticated function and assembly of the repli- 
cation/transcription machinery, as well as_ the 
characterisation of the structural interaction between 
the SARS-CoV S protein and its possible cellular recep- 
tors, for instance, ACE2. 


Update 

A number of structures of SARS coronavirus proteins have 
recently been published, including the structure of the 
SARS coronavirus spike receptor-binding domain (RBD) 
in complex with the receptor ACE2, determined by 
Harrison and colleagues [53°°]. The authors reveal that 
the interface between the two proteins shows important 
residue changes that facilitate efficient cross-species 
infection and human-to-human transmission, and suggest 
ways to make truncated disulfide-stabilised RBD variants 
for use in the design of coronavirus vaccines. 


The work referred to in the text as (Z Rao, unpublished) 
is now in press [54°*]. The crystal structure of the hex- 
americ complex between nsp7 and nsp8 to 2.4 A resolu- 
tion provides the first insight into the sophisticated 
architecture of the replication and transcription machin- 
ery. The supercomplex is a unique, hollow, cylinder-like 
structure assembled from eight copies of nsp8 and held 
tightly together by eight copies of nsp7. The central 
channel has dimensions and positive electrostatic proper- 
ties favourable for nucleic acid binding, implying that its 
role is to confer processivity on RdRp. The structure of 
nsp7 has also been determined in the free unbound form 


by NMR [55]. 


We are also aware that the crystal structure of the ADRP 
domain of nsp3 has been determined and is currently in 
press [56]. 
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